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Abstract 

In this paper, linear beamforming design for amplify-and-forward relaying cel- 
lular networks is considered, in which base station, relay station and mobile 
terminals are all equipped with multiple antennas. The design is based on min- 
imum mean-square-error criterion, and both uplink and downlink scenarios are 
considered. It is found that the downlink and uplink beamforming design prob- 
lems arc in the same form, and iterative algorithms with the same structure 
can be used to solve the design problems. For the specific cases of fully loaded 
or overloaded uplink systems, a novel algorithm is derived and its relationships 
with several existing beamforming design algorithms for conventional MIMO or 
multiuser systems are revealed. Simulation results are presented to demonstrate 
the performance advantage of the proposed design algorithms. 
Keywords: Amplify-and-forward (AF), cellular network, multiple- input 
multiple-output (MIMO), minimum mean-square-error, relay station. 
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1. Introduction 



Cooperative communication is a promising technology to improve quality 
and reliability of wireless links . One of the most important 

application scenarios of cooperative communications is cellular network. Due 
to shadowing or deep fading of wireless channels, base station may not be able 
to sufficiently cover all mobile terminals in a cell, especially those on the edge. 
Deployment of relay stations is an effective and economic way to improve the 
communication quality in cellular networks, as shown in Fig. [T] 

In cooperative cellular networks, there are two major strategies in relaying. 
Relay station can either decode the received signal before retransmission [9[ or 
simply amplify-and-forward (AF) the received signal to the corresponding des- 
tination without decoding [lC)l ]. AF strategy has low complexity and minimal 
processing delay, and is more secure. These reasons make AF preferable in 
practical implementation. In fact, deployment of AF relay station with multi- 
ple antenna to enlarge coverage of base station is one of the most important 
components in the future communication protocols, e.g., LTE, IMT- Advanced 
and Winner project [llj , ■ 

With multiple antennas at mobile terminals, relay station and base station, a 
natural question is how to allocate limited power resource in the spatial domain. 
In general, power allocation is equivalent to beamforming matrices design at 
base station, relay and mobile terminals, and the objective can be maximizing 
capacity [13I ] or minimizing the mean-square error (MSE) of the recovered data 



14[. The MSE criterion is a widely chosen one since it aims at the data be 



recovered as accurate as possible, and is extensively used in power allocation 
in classical point-to-point 15,[l3,[l^] or multi-user MIMO systems (lil [lil 20 . 
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23, 



24[ . The MSE minimization is also related to capacity maximization 



22[ if a suitable weighting is applied to different data streams. 
In a cellular network, the base station and relay station are usually allowed to 
be equipped with multiple antennas. For each mobile terminal, if it is equipped 
with single antenna, such relay cellular networks has been investigated from 
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various point-of-views. For example, beamforming design for capacity maxi- 
mization has been considered in [9j, and quality-of-service based power control 
has been investigated in [lol ]. However, in the next generation multi- media 
wireless communications, it is likely that the size of a mobile terminal allows 
multiple antennas to be deployed. Unfortunately, extension from the previous 
works on single antenna mobile terminals to multi-antenna terminals is by no 
mean straightforward. 

In this paper, we take a step further to consider the case where each mobile 
terminal is also equipped with multiple antennas. In particular, we consider the 
joint precoders, forwarding matrix, and equalizers design for both uplink and 
downlink AF relaying cellular network, under power constraints. The design 
problems are formulated as optimization problem minimizing the sum MSE of 
multiple detected data streams. While extension of the presented algorithm to 
weighted MSE criterion is straightforward, we focus on sum MSE for notational 
clarity. The contribution of the paper is as follows. Firstly, in the downlink, 
the precoder at base station, forwarding matrix at relay station and equalizers 
at mobile terminals are jointly designed by an iterative algorithm. Secondly, 
in the uplink case, we demonstrate that the formulation of the beamforming 
design problem has the same form as that in the downlink, and the same iterative 
algorithm can be employed. Thirdly, since the general iterative solution provides 
little insight, we derive another algorithm under the specific case when the 
number of independent data streams from different mobile terminals is greater 
than or equal to their number of antenna. It is found that the resultant solution 
includes several existing algorithms for multi-user MIMO or AF relay network 
with single antenna as special cases. 

The paper is organized as follows. In Section^ beamforming design problem 
in downlink is investigated, and an iterative algorithm is presented. In Section[3J 
the analogy of the uplink and downlink beamforming design problems is demon- 
strated. Furthermore, another beamforming design algorithm is derived for the 
specific case of fully loaded or overloaded system, and the relationships of this 
algorithm with other existing algorithms are discussed. Simulation results are 
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given in Section 2] to demonstrate the effectiveness of the proposed algorithms. 
Finally, conclusions are drawn in Section [5] 

The following notations are used throughout this paper. Boldface lowercase 
letters denote vectors, while boldface uppercase letters denote matrices. The 
notation Z H denotes the Hermitian of the matrix Z, and Tr(Z) is the trace of 
the matrix Z. The symbol 1m denotes an M x M identity matrix, while Om,n 
denotes an M x N all zero matrix. The notation Z 1 / 2 is the Hermitian square 
root of the positive semidefinite matrix Z, such that Z 1 / 2 Z 1 / 2 = Z and Z 1 / 2 
is also a Hermitian matrix. The operation diag{[A B]} is defined as a block 
diagonal matrix with A and B as block diagonal. The symbol £{•} represents 
the statistical expectation. The operation vcc(Z) stacks the columns of the 
matrix Z into a single vector. The symbol <E> denotes the Kronecker product. 
For two Hermitian matrices, C y D means that C — D is a positive semi-definite 
matrix. 

2. Downlink Beamforming Design 

2.1. System model and problem formulation 

On the boundary of a cell, due to shadowing or deep fading, the direct 
link between base station (BS) and mobile terminals may not be good enough 
to maintain normal communication. Then mobile terminals will rely on relay 
station to communicate with BS. As shown in Fig.[2Jt, in downlink, signal is first 
transmitted from the BS to the relay station and then the relay station forwards 
the received signal to the corresponding mobile terminals. It is assumed that the 
BS has Nb antennas and the relay station has Nr antennas. For the fc th mobile 
terminal, it has N&i.k antennas. The BS needs to simultaneously communicate 
with K mobile terminals via a single relay station. There are Lfc data streams 
to be transmitted from the BS to the k th mobile terminal, and the signal for 
the k th mobile terminal is denoted by a x 1 vector s*.. It is assumed that 
different data streams are independent, i.e., Ejs/jS^ 1 } = Oh k ,Lj when k ^ j and 
EjsfcS^} = Ii fc . With separate precoder Tfe for different mobile terminals, the 
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received signal at the relay station is 

r = H BR Ts + 77, (1) 

where Hbr denotes the Nb x Nr channel matrix between the BS and relay 
station, T = [Ti, ■ ■ • , Tk], s = [sj, ■ ■ ■ , s^-] T and the vector 77 denotes the 
additive Gaussian noise with zero mean and covariance matrix R,,. The power 
constraint at the BS is given by ^ fe Tr(TfcT)?) < P s , where P s is the maximum 
transmit power. 

At the relay station, before retransmission the signal r is multiplied with a 
forwarding matrix W under a power constraint Tr(WR r W) H < P r , where P r 
is the maximum transmit power at the relay station and R r is the covariance 
matrix of the received signal r: 

R r = Hsi?TT H H§ fi + Rj,. (2) 

Finally, at the k th mobile terminal, the received signal is 

y fc = n RM ,k whbhTs + n RM , k W77 + v fe , (3) 

where matrix H R M,k is the Nr x Nm,r channel matrix between the relay station 
and the k th mobile terminal, and is the additive Gaussian noise at the k th 
mobile terminal with zero mean and covariance matrix R V(c . 

At each mobile terminal, an equalizer G& is employed to detect the data. 
The mean-square-error (MSE) of data detection at the k th terminal is 

MSE fc (G fc ,W,T fe ) (4) 
= E{||G fe y fe -s fc || 2 } 

= Tr(G fe (H HM , fe WR r W H H* Mjfc + R V JG«) - Tr(G,H M/ , fc WH Bfl T fe ) 
- Tr((G fc H flM , fc WH Bfi T fc ) H ) + Tr(I L J. (5) 

Now defining y = [yj, ••• , y£] T , H RM = [H£ M)1 , ••• , H^ M , K ] T , v 
v : . • • • , v^] T , and G = diag{[Gi, • • • , Ga']}, the sum MSE can be written 
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as 

K 



MSE D (G,W,T) =^MSE fe (G fe ,W,T fc ) 



fe=i 

=Tr(G(H flM WR r W H Hg M + R V )G H ) - Tr(GH M/ WH Bfi T) 
- Tr((GH M/ WH BJ? T) H ) + Tr(I L ), (6) 

where L = J2k=i and R v = diag{[R Vl , • • • , R VK ]}- 

Therefore, the downlink beamforming optimization problem can be formu- 
lated as 

min MSEi3(G,W,T) 

G,W\T 

s.t. Tr(TT H ) < P a 

Tr(WR r W H ) < P r 

G = diag{[G ll ■•• , Gjf]}. (7) 

The optimization problem |(7j) is a nonconvex optimization problem for T, 
W and G, and there is no closed- form solution. This challenge remains even 
for the special case of multiuser MIMO systems [llj , , [3] where only single 
hop transmission is involved. However, notice that when two out of the three 
variables are fixed, the optimization problem ([7]) for the remaining variable is 
a convex problem, and thus can be solved. Therefore, an iterative algorithm 
alternating the design of three variables can be employed. 

2.2. Proposed iterative algorithm 

(1) Equalizer design at the destination 

When T and W are fixed, the optimization problem ((7|) is an unconstrained 
convex quadratic optimization problem for G. Furthermore, since the structure 
of G is block diagonal, the design of individual Gfc are decoupled. Therefore, 
the necessary and sufficient condition for the optimal solution is 

9E fe MSE fc (G fc ,W,T fc x 



dGt 



(8) 



G 



and the optimal equalizer for the fc th mobile terminal can be easily shown to be 
G k = (H flia WH BJi T t ) H (H fflJi WR r W H HH i(t + r v J-i. (9) 

(2) Forwarding matrix design at the relay station 

When T and G are fixed, the optimization problem ([7]) is a constrained 
convex optimization problem for the variable W, and the Karush-Kuhn- Tucker 
(KKT) conditions are the necessary and sufficient conditions for the optimal so- 
lution 25j . The KKT conditions of the optimization problem ([7]) with respective 



to W are 

H£ A/ G H GH M/ WR r + AWR r = (H Bfl TGH M1 ) H (10) 

A(Tr(WR r W H ) - P r ) = 0, A > 0, (11) 

Tr(WR r W H ) < P r , (12) 

where A is the Lagrange multiplier. 

Based on the first KKT condition (fT0|) . the optimal forwarding matrix W 
can be written as 

W = (Hl M G H GH RM + ai)- 1 (h bk tgh^m) h R i : 1 , (13) 

where the value of A is computed using (fTTj) and (fT2"j). Since A also appears in 
W, (1111) and (|12[) depends on A in a nonlinear way and there is no closed-form 
solution. Below, we propose a low complexity method to solve (fTTj) and (fT2j) . 

First, notice that in order to have f) 1 1 [) satisfied, either A = or Tr(WR r W H ) 
P r must hold. If A = also makes (|T2"j) satisfied, A = is a solution to (fTTj) and 
(fT2|) . On other hand, if A = does not make (fT2|) satisfied, we have to solve 
Tr(WR r W H ) = P r . It can be proved that Q when T and G arc fixed, the 
function /(A) = Tr(WR r W H ) is a decreasing function of A and the range of A 
must be within 



0<»<^ (14) 

where E = ^ fc {(Hs/;TfcGfcH/;j\/.fc) H }- Therefore, A can be efficiently com- 
puted by one-dimension search, such as bisection search or golden search. Since 
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Tr(WR r W H ) = P r is a stronger condition than Tr(WR r W H ) < P r , (O is 
satisfied automatically in this case. In summary, A is computed as 

if/ros* (l5) 

Solve /(A) = P r using bisection algorithm Otherwise 

(3) Precoder design at the BS 

When W and G are fixed, the optimization problem can be straightfor- 
wardly formulated as the following convex quadratic optimization problem for 
the precoder T 

nun Tr(N^T H AoTNo) + 2ft{Tr(B 1 T)} + c 
s.t. Tr(NfT H AiTNi) + 2ft{Tr(B?T)} + d < 0, 

Ti-(N^T H A 2 TN 2 ) + 2ft{Tr(B 2 1 T)} + c 2 < 0, (16) 

where the corresponding parameters are defined as 

A = H^W H HH M G H GH flM WH Biil A x = I, A 2 = U%W H WH BR , 

B^ = — GH^fWHsfl, Bi = B 2 = 0, 

N =Ni =N 2 =I L , 
c = Tr(R, 7 W H HH M G H GH fiA/ W) + Tr(I L ) + Tr(GR v G H )) 
ci = -P s , c 2 = Tt(WR v W h ) - P r . (17) 

Notice that the objective function and the constraints arc of the same form. 
Using the property Tr(AB) = vec H (A H )vec(B) and the property of Kronecker 
product, we can write (I = 0, 1, 2) 

Tr(NfT H A ; TNi) =Tr(NfT H Af Af TN Z ) 

=vec H (Af TN, ) vec( Af TN* ) 

=vec H (T)(N ; * ® Af)(Nj g> Af )vec(T), (18) 

where the first equality is based on the fact that A;'s are positive semidefinite 
matrices. Furthermore, we can also write Tr(BpT) = vec H (B[ i )vec(T). Putting 
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these two results into ([T5| and after introducing an auxiliary variable t [28( , (|16p 
is equivalent to the following optimization problem 



min t 



s.t. 



vec H (T)(NS 
vec H (T)(Nt 
vcc H (T)(N; 



) AfXNj® A ^)vec(T) <t 

n 

3 A x 2 



)(N? 



_H_ 

) A 9 2 



3 \\ )vec(T) < 
3 A|)vcc(T) < 



2^{vec H (B^)vec(T)} 
2^{vcc H (Bf)vec(T)} 
vec H (B^)vcc(T)}. 



-ci - 



-c 2 - 2ft{ 



(19) 



Since Cq does not affect the optimization problem, it has been neglected in ([T 



With the Schur complement lemma 



32| , the optimization problem (|19[) can 



be further reformulated as the following semi-definite programming (SDP) prob- 



lem 



28 



min t 
T.t 



s.t. 



((N^Aj)vec(T)) H 



(Nj® A *)vcc(T) 
-2^{vcc H (B )vec(T)} + t 

I (N7® Af)vec(T) 

((N^ <g) Af )vec(T)) H -2^{vec H (B;)vec(T)} - q 



>- 



>-0, J = 1,2. 



(20) 



The precoder at the BS is designed by solving this SDP problem using standard 
numerical algorithms such as interior-point polynomial algorithms 2(3] , 2s| . 

2.3. Summary and Initialization 

In summary, the downlink beamforming matrices are computed iteratively. 
Since in each iteration, the MSE monotonically decreases, the iterative algo- 
rithm is guaranteed to converge to at least a local optimum. For initialization, 
identity matrices can be chosen as initial values due to its simplicity and bet- 
ter performance compared to randomly generated initial matrices [3] , [ljj] , . 
On the other hand, we can also use a suboptimal design by viewing the downlink 
dual-hop AF MIMO relay cellular networks as a combination of conventional 
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point-to-point MIMO system in the first hop, and multiuser MIMO downlink 
system in the second hop. More specifically for the first hop, the linear mini- 
mum mean-square-error (LMMSE) precoder T at BS and equalizer Wi at relay 
station can_ be jointly designed using the point-to-point water-filling solution 
given in 



171 ]. For the second hop, the precoder W2 at relay station and equal- 
izer G at mobile terminals can be designed using the bcamforming algorithm for 
multiuser MIMO systems proposed in Q. Based on the results of Wi and Wj, 
the forwarding matrix at relay station equals to W = W1W2. We refer this 
suboptimal algorithm as 'separate LMMSE transceiver design'. It will be shown 
in Simulation section that the convergence speed using the second initialization 
is better than that of the first one. Finally, the iterative design procedure is 
formally given by 
Algorithm 1 

With initial G°, W° and T°, the algorithm proceeds iteratively and in each 
iteration: 

(1) G is updated using ((9j); 

(2) W is updated using (JT3J) and (fT5]): 

(3) T is updated by solving pO]) . 

The algorithm stops when ||MSE^ - MSE^ +1 || < 7b, where MSE^ is the total 
MSE in the I th iteration and Tb is a threshold value. 



3. Uplink Beamforming Design 

3.1. System model and analogy with downlink design 

In this section we will focus on beamforming matrices design for uplink, 
as shown in Fig. ^Bp. In uplink, there are Lk data streams to be transmitted 
from the k th mobile terminal to the BS, and the signal from the k th mobile 
terminal is denoted as Sfe. Without loss of generality, it is assumed that the 
transmitted data streams arc independent: EjstsJ 1 } = OL k ,Lj when k 7^ j and 
E{sfeS^} = Ii fc . At the k th mobile terminal, the transmit signal is multiplied 
by a precoder matrix Pj, under a power constraint Tr(PfcPj?) < P s ,k, where P Sj fe 
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is the maximum transmit power at the k mobile terminal. The received signal 
x at the relay station is the superposition of signals from different terminals 
through different channels and is given by 

x = H MR Ps + n, (21) 

where H. M r - \H-mr,i • • • H^], P = diag{[P x , • • • , Pa-]}, s = [sj ■■■ s£] T , 
with H^jB.k being the N R x Nm,u channel matrix between the k th mobile termi- 
nal and relay station, and n is the additive Gaussian noise at the relay station 
with zero mean and covariance matrix R n . Since the data transmitted from 
different mobile terminals are independent, the correlation matrix of x equals 
to 

R x = n MR PP u H MR + R n . (22) 

At the relay station, the received signal x is multiplied with a linear for- 
warding matrix F, with a power constraint Tr(FR x F H ) < P r , where P r is the 
maximum transmit power at the relay station. Finally, the received signal at 
the BS is 

y = HRsFHjvfflPs + H&gFn + £, (23) 

where H R b is the N R x N R channel matrix between the relay station and BS, 
and £ is the additive zero mean Gaussian noise with covariance R^. 

When a linear equalizer B is adopted at the BS, the total MSE of the detected 
data is 

MSE C ,(B,F,P) =E{||By-sj| 2 } 

=Tr(B(H flfl FR x F H H* B + R € )B H ) - Ty(BU rb FH mr P) 
- Tr((BH KB FH ra P) H ) + Tr(I L ), (24) 

where L = X^fcLi i s the total number of data streams. Finally, the optimiza- 
tion problem for beamforming matrices design in the uplink case is formulated 
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MS 



mm 

B,F,P 



MSEtrCB.P.P) 



s.t. 



Tr(P fe P») < P 3 , k , k = l,---,K 



Tr(FR x F ) < P r 



P = diag{[Pi, ••• , Pk)}- 



(25) 



Comparing (f2~5j) with the downlink problem (O, it can be seen that the two 
problems are in the same form, except that i) there are individual constraints 
on Pfe in (|25|) instead of a sum constraint on the corresponding Tfc in ([7]), and ii) 
the diagonal structure constraint is on precoder instead of equalizer. However 
we can still employ the iterative algorithm developed in the previous section 
for this uplink beamforming design problem. More specifically, for equalizer B 
design, the problem is an unconstrained convex optimization problem and the 
optimal solution can be directly computed from the derivative of the objective 
function. For forwarding matrix F design, the problem is a convex quadratic 
optimization problem with only one constraint. In this case, the optimal solution 
can be solved based on KKT conditions. Finally, for precoder P design, the 
problem is a convex quadratic optimization with multiple constraints, which 
can be transformed into a standard SDP problem. Notice that a SDP problem 
can handle any number of linear matrix inequality constraints and the diagonal 
structure of P does not affect the SDP problem. 

Although the optimization problem (|25[) can be solved using an iterative 
algorithm alternating the three variables B, F and P, this solution provide 
little insight into the nature of the problem. Below we consider the fully loaded 
or overloaded MIMO systems in which the number of independent data streams 
from mobile terminals is greater than or equal to the number of its antennas, 
i.e., Nm,h < Lk [2], [jjcj]. The solution is found to be insightful and includes 
several existing algorithms for conventional AF MIMO relay or multiuser MIMO 
as special cases. 
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3.2. Uplink beamforming design for fully loaded or overloaded systems 

First, wc reduce the number of variables of the optimization problem. Notic- 
ing that there is no constraint on B, the optimal B satisfies 9MSE;y (B, F, P)/ <9B* = 
®l,n b , and the optimal equalizer at the BS can be written as a function of for- 
warding matrix and precoder matrix. Therefore B = (HflBFHMKP) H (Hi?BFR x F H H^. B + 
R^) -1 . Substituting this result into fM§, the uplink MSE is simplified as 

MSE,y(F,P) 

= Tr(I L ) - Tr((H fiB FH Mfi P) H (H^FR x F H Hi B + R$ )~ 1 (H-rb FHmrP)) ■ 

(26) 

Based on the definition of R x = Hj\//?PP H H^ /fl + R n , it can be expressed 

as 

R x = Ry 2 (R- 1 / 2 H M flPP H H^ fl R n 1 / 2 + I)R^ /2 . (27) 

_A ,_, 

Now introducing F = FRy 2 H 1/2 , the MSE becomes 

MSEir(P,P) =Tr(I L ) - Tr((H^ B FH- 1 / 2 R n 1 / 2 H A/H P) H 

x (H M FF H Hi B +R € )- 1 (H KB FH- 1/2 R n 1 / 2 H A/ flP)). (28) 

Thus the uplink beamforming design optimization problem (|25[) is rewritten as 
min MSEc/(F,P) 

F,P 

s.t. Tr(P fc Pf) < P Sik , fe = l,... ,K 
Tr(FF H ) < P r 

P = diag{[P 1 , ••• , P K }}. (29) 

Unfortunately, the optimization problem (|2"9")l is still nonconvex for F and P, 
and thus there is no closed-form solution. However, notice that if either F or 
P is fixed, the optimization problem is convex with respect to the remaining 
variable. Therefore, an iterative algorithm which designs F and P alternatively, 
is proposed as follows. 
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(1) Design F when P is fixed 

From (f28f , it is noticed that F appears both inside and outside of the inverse 
operation. In order simplify the objective function, we use the following variant 
of matrix inversion lemma 



C H (CC H + D^C = I (C^D^C + I) -1 . 



(30) 



Taking C = U RB F and D = R € , the MSE ([5SJ can be reformulated as [27 1 
MSE^F, P) =Tr((S- 1 / 2 R-i/2 HMi?P)(s -i/2 R -i/2 HMfl p ) H 

x (F H Hg B R € 1 HhbP + I)" 1 ) + Tr((P H Hj^- i? R~ 1 Hjvfi?P + I)" 1 ). 

(31) 

Now, F only appears inside the matrix inverse. If P is fixed, the last term of 
(|3"Tj) is independent of F, and the optimization problem ([29)) becomes 

min Tr((H- 1 /2R-i/2H Mfl P)(S- 1 /2 R -i/2 HMfl P)H(F H H&jR^HjtB F + I)" 1 ) 

p « v ' * : ' 



=M 



s.t. Tr(FF H ) < P r . 



(32) 



Based on cigen-dccomposition, = U©A©U@ and M = UmAmU^, and 
defining 



A^UpUe, 

the optimization problem (|32[) can be simplified as 

min Tr(A©(A|A M A f + 1)" 1 ) 
p 

s.t. Tr(A f A|) < P r . 



(33) 



(34) 



Without loss of generality, the diagonal elements of A© and Am are assumed to 
be arr ang ed in decreasing order. The closed-form solution of (|34l) can be shown 



to be \2d 



i r-i/2ri/2 a _1n + 

/^j A M A © A M 



>N R -L,L 



-,1/2 



Ol,n r -l 
0N n -L.N R -L 



(35) 
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where A© and Am are the L x L principal submatrices of A© and Am, respec- 
tively. The scalar fj,f is the Lagrange multiplier which makes Tr(A-g,A~5) = P r 
hold. Based on (|3"3"|) and (|35[) . the optimal F can be recovered as 



U 



1 r— 1/2 r 1/2 -— 1 



A - A M 



1 1/2 



u 



0,L' 



(36) 



where Um,l and U©^ are the first L columns of Um and U©, respectively. 
Finally, the optimal F is given by F = FE^^Rn . 
(2) Design P when F is fixed 



Since S in (|28f depends on P, the MSE expression in (|28|) is a complicated 
function of P, direct optimization of P seems intractable. However, based on 
the property of trace operator Tr(DC) = Tr(CD), the total MSE <(2S]) can be 



reformulated as 



31 



MSE C/ (F,P) 

=Tr(I L ) - Tr((H flB F) H (H HB FF H H£ B + R^)" 1 

x H^FXH- 1 / 2 R n 1 / 2 H M ^PP H Hf f fl R n 1/2 3^ /2 )) 



=Tr(I L ) - Tr((H fl flF) H (H iJB FF H Hg fl + R € )- 1 H KB F)(I JVj? - H" 1 )). (37) 
Substituting the definition of H into (f37|) . the MSE can be further rewritten as 



MSEc/(F,P) 
=Tr((H flB F) H (H flB FF H HH B + R^-^H^F) 

V v ' 

x (R^ 1 / 2 HMflPP H H^ fl R I ^ ly ' 2 + Ijv h ) _1 ) 
+ Tr(I L ) - Tr((H i?B F) H (H fiB FF H Hg B + R^^H^F)), (38) 

where P only appears inside of the inverse operation. As the last two terms of 
are independent of P, the optimization problem for P is 

min Tr(n(R n 1 / 2 H Mfl PP H H^ fi ,R n 1 / 2 + I^)" 1 ) 

Tr(P k Pf) < P s>k k = 1, ■ ■ ■ , K 

P = diag{[Pi, ••• , Pk}}- (39) 



p 

s.t. 
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With the definitions of Hmr and P, 

K 



fc=1 =Q fc 
Putting ((301) into (f3U)) . the optimization problem becomes 

if 

min Tr(II(R n 1 / 2 £{H A „ l , fc Q ik H& ^R?'* + IjvJ" 1 ) 
Qk fc=i 



(40) 



s.t. Tr(Q fc ) < P a>fc , fc = 1 

Q fc >= o. 



(41) 



Using the Schur-complement lemma 



32| , the optimization problem (|4T|) can be 



further formulated as a standard SDP optimization problem 



31] 



min Tr(X) 

X,Q fc 



S.t. 



x n 1 / 2 

II 1 / 2 R n '' 2 ^ fc {H A /^ ife Q fe H^ /i? fe }R r 
Tr(Q fe )<P S)fe , fc=l,...,Jf 
Qfc to. 



1/2 



>- 



(42) 



The SDP problems can be efficiently solved using interior-point polynomial al- 
gorithms [26( | . 

In summary, when Njtf,k < L^, the uplink beamforming design alternates 
between the design of F in (f36| and Qk in (|42| . The algorithm stops when 



|MSE^ 



MSE 



<Tu, where MSE^ is the total MSE in the I th iteration 

j/2 *" 1/2 1/2 

and 7j/ is a threshold value. After convergence, Pfc = Q fc , F = FH ' R n ' 
and B = (H i?B FH MK P) H (H flB FR x F H Hg B + R € )" 1 . We refer the algorithm 
in this section as Algorithm 2. 

Remark 1: In case Nm,U > Lk, there is an additional constraint RankjQk} < 
N k in igU). In this case, as rank constraints are nonconvcx, transition from 
(j4Tj) to (14211 involves a relaxation on the rank constraint. Then the objective 
function of (|4"2"|) is a lower bound of that of (|4"Tj) . However, this problem seems 
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to be common to all multiuser MIMO uplink beamforming 2l| . [23J. Notice 
that when Nm,u < Lk, there is no relaxation involved. 

3.3. Special cases 

Notice that (f31)| has a more general form than the water-filling solution in 
traditional point-to-point MIMO systems. On the other hand, PS|) is a SDP 
problem frequently encountered in multiuser MIMO systems. In particular, they 
include the following existing algorithms as special cases. 

• If Hrb = II and = Ol.l, we have II = II in flTj) , and the SDP optimiza- 



tion problem (f4"2"j) reduces to that of the uplink multiuser MIMO systems 21 1 



231 ] . Therefore, they have the same solution. 

• Substituting K — 1 and P = into (piBl) . it reduces to the solution proposed 
for LMMSE joint design of relay forwarding matrix and destination equalizer in 
AF MIMO relay systems without source precodcr [3]. 

• Notice that when there is only one mobile terminal (K = 1), the optimiza- 
tion problem (|39p is in the same form as (|3"2")l . Defining H^^R^HMi?, = 
XJMR-h-MRjJ^tfR, arid II = UnAnU^, a closed-form solution can be derived 
using the same procedure as for F, and we have 

1/2 



P = U 



MR,L 



1 - -1/2 -1/2 ~-l X 



(43) 



where the Amr and An are the L x L principal submatrices of Amr. and An, 
respectively, and the matrix \J mr,l is the first L columns of XJmr- The scalar 
Hp is the Lagrange multiplier which makes Tr(PP H ) = P s ,i hold. In this case, 
the solution given by (f4"3"]) corresponds to the source precoder design for AF 
MIMO relay systems with single user 

• Furthermore, substituting Hrb = II and = Ol.l into (|43l) . it becomes 
the closed-form solution for LMMSE transceiver design in point-to-point MIMO 



systems 



17] 
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4. Simulation Results and Discussions 



In this section, we investigate the performance of the proposed algorithms for 
downlink and uplink. In the simulations, there is one BS, one relay station and 
two mobile terminals. For each mobile terminal, two independent data streams 
will be transmitted in the uplink (or received in the downlink) simultaneously. 
For each data stream, 10000 independent QPSK symbols are transmitted. The 
elements of MIMO channels between BS and relay station and between relay 
station and mobile terminals are generated as independent complex Gaussian 
random variables with zero mean and unit variance. Each point in the following 
figures is an average of 500 independent channel realizations. In order to solve 
SDP problems, the widely used optimization matlab toolbox CVX is adopted 



33[. The thresholds for terminating the iterative algorithms are set at Tb 



Tu = 0.0001. 

First, let us focus on the downlink. In downlink, the noise covariance ma- 
trices at relay station and mobile terminals are = <j^1n h and R Vl = R V2 = 
<j11n m , respectively. We define the first hop SNR at the relay station as P s /a v 2 , 
and the second hop SNR at mobile terminals as P r /a 2 . Fig. [3] shows the conver- 
gence behavior of the proposed Algorithm 1 for downlink with different second 
hop SNR at mobile terminals when Nb — 4, Nr = 4, Nm,h — 2. Both initial- 
izations with identity matrices and the separate LMMSE design are shown. It 
can be seen that the proposed algorithm converges quickly, within 20 iterations. 
Furthermore, the convergence speed with separate LMMSE design as initializa- 
tion is faster than that with identity matrices. It can also be seen that the two 
initializations result in the same MSE after convergence. 

Fig.[4]compares the total data MSEs of the proposed Algorithm 1 and several 
suboptimal algorithms versus the first hop SNR P s /af r The second hop SNR at 
mobile terminals is fixed to be 20dB. The number of antennas is set as Nb = 4, 
Nr = 4 and Nm,u = 2. The suboptimal algorithms under consideration are 
• Direct amplify-and-forward, in which the precoder T at BS and forwarding 
matrix W at relay are proportional to identity matrices. At mobile terminals, 
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LMMSE equalizer for the combined first hop and second hop channel is adopted 
to recover the signal . 

• The first hop channel is equalized at relay and then the second hop channel 
is equalized at mobile terminals, both with LMMSE equalizers. 

• Separate LMMSE design proposed for initialization of Algorithm 1. 

From Fig. [U it can be seen that as there is no precoder design at BS for the first 
two suboptimal algorithms, the data streams at different terminals cannot be 
efficiently separated by linear equalizers, resulting in poor performances. The 
separate LMMSE transceiver design has a much better performance. On the 
other hand, the proposed Algorithm 1 has the best performance among the four 
algorithms. The gap between the MSEs of the separate LMMSE design and 
that of Algorithm 1 is the performance gain obtained by additional iterations. 

As the proposed Algorithm 1 involves a computational expensive SDP for the 
precoder T design, it is of great interest to investigate how much degradation 
would result from skipping the precoder design. Fig. [5] compares the total 
data MSEs of the proposed Algorithm 1 and the same algorithm but fixing the 
precoder T oc I. The second hop SNR at mobile terminals P r /<J 2 is fixed to be 
20dB. From Fig.[5j it can be seen that a properly designed precoder significantly 
improves the system performance when the first hop SNR is high. Without the 
precoder, the data MSEs exhibit error floors at much lower P s /cri, On the 
other hand, we can also see that increasing the number of antennas at the relay 
station greatly improves the system performance, as it simultaneously increases 
the diversity gain of the two hops. 

Now, let us turn to the results in the uplink. In uplink case, the noise 
covariance matrices at relay station and BS are R n = (t„Ijv r and = cr|ljv B , 
respectively. We define the fist hop SNR at the relay station as P s /a n 2 , where 
P s = ^2k—iPs,k- The second hop SNR at the BS is defined as P r ja 2 ^. 

Fig.[B]shows the convergence behavior of the proposed Algorithm 2 for uplink 
when Nb = 4, Nr = 4 and Nm,h = 2. Notice that in this case, at each 
mobile terminal the number of antennas equals to that of the data streams, 
and Algorithm 2 involves no relaxation. The initialization is identity matrices. 
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It can be seen that Algorithm 2 converges very fast, indicating its superior 
performance. 

Fig. [7J shows the total data MSEs of the proposed Algorithm 2 and subopti- 
mal algorithms, when Nb = 4, Nr = 4, Nm,u = 2 and the SNR at relay station 
Ps/&n is fixed to be 20dB. The suboptimal algorithms arc similar to those for 
the downlink. In particular, we consider 

• Equalization of the equivalent two-hop channel is applied only at the BS. 

• Equalization is applied at relay station for the mobile-to-relay channel, and 
also at BS for the relay-to-BS channel. 

• Separate LMMSE design. The first hop is considered as a traditional multiuser 
MIMO uplink system, and the bcamforming matrices arc designed using the 



algorithms in 



191 and 
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The second hop is considered as a point-to-point 



MIMO system, and the beamforming matrices are designed using the result in 



17|. 



From Fig. [7J it can be seen that the performance of the proposed Algorithm 2 is 
better than other suboptimal algorithms. However, as the signals from different 
terminals are cooperatively detected at BS, the gaps between the performance of 
the suboptimal algorithms from that of Algorithm 2 is much smaller compared 
to their counterparts in downlink. 

When Lk < Nm,u m the uplink, strictly speaking, Algorithm 2 involves a 
relaxation, and its performance is not guaranteed. However, a simple variation 
of Algorithm 1 can be used for beamforming design in this case. Fig.|S]shows the 
total data MSEs of Algorithm 1 for uplink and Algorithm 2 with rank relaxation, 
when L k = 2 and N M ,k = 4. The SNR at BS is fixed at P r /<r|=20dB. The joint 
relay forwarding matrix and destination equalizer design in [3[ is also shown for 
comparison. It can be viewed as a design without source precoders at mobile 
terminals. From Fig. [FJ it can be seen that Algorithm 1 and Algorithm 2, 
which involve the joint design of precoder, forwarding matrix and equalizer 
perform better than the algorithm in s|. This indicates the importance of 
source precoder design in AF relay cellular networks. Furthermore, although 
Algorithm 2 involves a relaxation, its performance is still satisfactory, and is 
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close to that of Algorithm 1. Finally, it can also be concluded that increasing 
the number of antennas at relay station can greatly improve the performance of 
uplink bcamforming design for all algorithms. 

5. Conclusions 

In this paper, LMMSE beamforming design for amplify-and-forward MIMO 
relay cellular networks has been investigated. Both uplink and downlink cases 
were considered. In the downlink, precoder at base station, forwarding matrix 
at relay station and equalizer at mobile terminals were jointly designed by an 
iterative algorithm. On the other hand, in the uplink case, we demonstrated 
that in general the beamforming design problem can be solved by an iterative 
algorithm with the same structure as in the downlink case. Furthermore, for the 
fully loaded or overloaded uplink systems, a novel beamforming design algorithm 
was derived and it includes several existing algorithms for conventional point- 
to-point or multiuser systems as special cases. Finally, simulation results were 
presented to show the performance advantage of the proposed algorithms over 
several suboptimal schemes. 

References 

[1] A. Scaglione, D. L. Goeckel, J. N. Laneman, Cooperative communications 
in mobile Ad Hoc networks, IEEE Signal Process. Magaz. 23 (5) (2006) 
18-29. 

[2] J. N. Laneman, D. N. C. Tse, G. W. Wornell, Cooperative diversity in 
wireless networks: Efficient protocols and outage behavior, IEEE Trans. 
Inf. Theory 50 (12) (2004) 3062-3080. 

[3] W. Guan, H. Luo, Joint MMSE transceiver design in non-regenerative 
MIMO relay systems, IEEE Commun. Lett. 12 (7) (2008) 517-519. 

[4] X. Tang, Y. Hua, Optimal design of non-regenerative MIMO wireless relays, 
IEEE Trans. Wireless Commun. 6 (4) (2007) 1398-1407. 



21 



[5] Y. Rong, X. Tang, Y. Hua, A unified framework for optimizing linear 
non-regenerative multicarrier MIMO relay communication systems, IEEE 
Trans. Signal Process. 57 (12) (2009) 4837-4851. 

[6] A. S. Behbahani, R. Mcrched, A. J. Eltawil, Optimizations of a MIMO 
relay Network, IEEE Trans. Signal Process. 56 (10) (2008) 5062-5073. 

[7] H. Bolcskei, R. U. Nabar, O. Oyman, A. J. Paulraj, Capacity scaling laws 
in MIMO relay networks, IEEE Trans. Wireless Commun. 5 (6) (2006) 
1433-1443. 

[8] B. Wang, J. Zhang, A. Host-Madscn, On the capacity of MIMO relay 
channels, IEEE Trans. Inf. Theory 51 (1) (2005) 29-43. 

[9] C.-B. Chae, T. W. Tang, R. W. Health, S.-Y. Cho, MIMO relaying with 
linear processing for multiuser transmission in fixed relay networks, IEEE 
Trans. Signal Process. 56 (2) (2008) 727-738. 

[10] R. Zhang, C. C. Chai, Y.-C. Liang, Joint beamforming and power con- 
trol for multiantenna relay broadcast channel with QoS constraints, IEEE 
Trans. Signal Process. 57 (2) (2009) 726-737. 

[11] S. Stcfania, I. Toufik, M. backer, LTE, the UMTS Long Term Evolution: 
From Theory to Practice, Wiley, 2009. 

[12] A. Osseiran, etc, The road to IMT-advanced communication systems: 
state-of-the-art and innovation areas addressed by the WINNER + project 
- [topics in radio communications], IEEE Communication Magazine 47 (6) 
(2009) 38-47. 

[13] I. E. Tclatar, Capacity of multi-antenna Gaussian channels, European 
Trans, on Telecommu. 10 (6) (1999) 585-595. 

[14] S. Kay, Fundamental of Statistical Signal Processing: Estimation Theory, 
Englewood Cliffs, NJ: Prentice-Hall, 1993. 



22 



[15] E. G. Larsson, P. Stoica, Space-Time Block Coding for Wireless Commu- 
nications, Cambridge University Press, 2003. 

[16] D. Tse, P. Viswanath, Fundamentals of Wireless Communication, Cam- 
bridge University Press, 2005. 

[17] H. Sampath, P. Stoica, A. Paulraj, Generalized linear prccodcr and de- 
coder design for MIMO channels using the weighted MMSE criterion, IEEE 
Trans. Commun. 49 (12) (2006) 2198-2206. 

[18] J. Zhang, Y. Wu, S. Zhou, J. Wang, Joint linear transmitter and receiver 
design for the downlink of multiuser MIMO, IEEE Commun. Lett. 9 (11) 
(2005) 991-993. 

[19] S. Shi, M. Chubert, H. Boche, Rate optimization for multiuser MIMO 
systems with linear processing, IEEE Trans. Signal Process. 56 (8) (2008) 
4020-4030. 

[20] S. Serbetli, A. Yener, Transceiver optimization for multiuser MIMO sys- 
tems, IEEE Trans. Signal Process. 52 (9) (2004) 214-226. 

[21] M. Codreanu, A. Tolli, M. Juntti, M. Latva-aho, Joint design of Tx-Rx 
bcamforming in MIMO downnlink channel, IEEE Trans. Signal Process. 
55 (9) (2007) 4639-4655. 

[22] S. S. Christensen, R. Agarwal, E. dc Carvalho, J. M. Cioffi, Weighted 
sum-rate maximization using weighted MMSE for MIMO-BC bcamforming 
design, IEEE Trans. Wireless Commun. 7 (12) (2008) 4792-4799. 

[23] R. Hunger, M. Joham, W. Utschick, On the MSE-Duality of the broadcast 
channel and multiple access channel, IEEE Trans. Siganl Process. 57 (2) 
(2009) 698-713. 

[24] S. Shi, M. Chubert, H. Boche, Downlink MMSE transceiver optimization 
for multiuser MIMO systems: Dulaity and sum-MSE minmization, IEEE 
Trans. Signal Process. 55 (11) (2007) 5436-5446. 



23 



[25] A. Beck, Quadratic matrix programming, SIAM Journal on Optimization 
17 (4) (2007) 1224-1238. 

[26] S. Boyd, L. Vandenberghe, Convex Optimization, Cambridge University 
Press, 2004. 

[27] C. Xing, S. Ma, Y.-C. Wu, Robust joint design of linear relay precodcr 
and destination equalizer for dual-hop amplify-and-forward MIMO relay 
systems, IEEE Trans. Signal Process. 58 (4) (2010) 2273-2283. 

[28] L. Vandenberghe, S. Boyd, Semidefinite programmming, SIAM Review 
38(1)(1996) 49-95. 

[29] K.-K. Wong, A. Paulraj, R. D. Murch, Efficient high-performance decoding 
for overloaded MIMO antenna systems, IEEE Trans. Wireless Commun. 6 
(5) (2007) 1833-1843. 

[30] R. de Miguel, V. Gardascvic, R. R. Muller, F. F. Knudsen, On overloaded 
vector precoding for single-user MIMO channels, IEEE Trans. Wireless 
Commun. 9 (2) (2010) 745-753. 

[31] C. Xing, Linear Mean-Square-Error Transceiver Design for Amplify-and- 
Forward Multiple Antenna Relaying Systems, Ph.D. Thesis, The University 
of Hong Kong, Hong Kong, July 2010. 

[32] R. A. Horn, C. R. Johnson, Matrix Analysis, Cambridge University Press, 
1985. 

[33] M. Grant, S. Boyd, Y. Y. Ye, CVX: Matlab Software for Disciplined 



Convex Programming, available at: http://www.stanford.edu/boyd/cvx 
V.1.0RC3, 2007. 



24 



u 

L 



_ „ I 1 Efficient r 

Cell area 

I 1 covarage area L 



Figure 1: Amplify- and- forward MIMO relaying cellular network. 
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Figure 2: Amplify-and-forward MIMO relaying downlink and uplink cellular systems. 




Figure 3: The convergence behavior of the proposed Algorithm 1 when Nb = 4, Nr = 4 and 
Nm k — 2 with 2 users. 
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Figure 4: Total MSEs of detected data of the proposed Algorithm 1 and suboptimal algo- 
rithms, when N B = 4, N R = 4, N M ,k = 2 and Pr/^=20dB. 




Figure 5: Total MSEs of detected data of the proposed Algorithm 1 with and without precoder 
design. 
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Figure 6: The convergence behavior of Algorithm 2 for uplink when Nb = 4, Nr = 4 and 
N M ,k = 2. 
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Figure 7: Total MSEs of detected data of Algorithm 2 and sub-optimal algorithms, when 
N B =4, N a = 4, JV M t = 2 and P s /al=20dB. 
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